Overview

Dataset info

Number of variables62
Number of observations90697
Missing cells832241 (14.8%)
Duplicate rows0 (0.0%)
Total size in memory160.5 MiB
Average record size in memory1.8 KiB

Variables types

CAT36
NUM17
BOOL8
DATE1

Reproduction info

Date of analysis2020-05-18 12:32:59.147027
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

Alternative_purchasing_group has 3542 (3.9%) missing values Missing
Alternative_purchasing_group has a high cardinality: 561 distinct values Warning
ATC has a high cardinality: 1111 distinct values Warning
Chemical_/_biological has 7621 (8.4%) missing values Missing
consumption has 7514 (8.3%) zeros Zeros
consumption_previous_year has 3745 (4.1%) zeros Zeros
Current_consumption has 8480 (9.3%) zeros Zeros
Dangerous_material_is_charged_with_a_permit has 80310 (88.5%) missing values Missing
dangerous_substance has 80326 (88.6%) missing values Missing
Date_of_basket_entry has 41213 (45.4%) missing values Missing
Date_of_basket_entry has a high cardinality: 54 distinct values Warning
Description_Alternative_purchasing_group has 3542 (3.9%) missing values Missing
Description_Alternative_purchasing_group has a high cardinality: 561 distinct values Warning
Description_outline has 6891 (7.6%) missing values Missing
Description_outline has a high cardinality: 306 distinct values Warning
European_patent_expires has 89409 (98.6%) missing values Missing
For_adults has 86837 (95.7%) missing values Missing
Form_of_giving has 52945 (58.4%) missing values Missing
GENERY_FATHER has 53724 (59.2%) missing values Missing
GENERY_FATHER has a high cardinality: 680 distinct values Warning
Inventory has 10370 (11.4%) zeros Zeros
Inventory_of_consumption_months is highly skewed (γ1 = 82.270332) Skewed
Inventory_of_consumption_months has 32934 (36.3%) zeros Zeros
Main_outline has 6891 (7.6%) missing values Missing
Main_outline has a high cardinality: 306 distinct values Warning
Narcotic_/_psychotropic has 41313 (45.6%) missing values Missing
Narcotic_/_psychotropic has 46358 (51.1%) zeros Zeros
Plant has constant value "7350.0" Rejected
Prediction has 8934 (9.9%) zeros Zeros
PRICE is highly skewed (γ1 = 24.55626018) Skewed
Price_for_absolute_packaging has 1324 (1.5%) zeros Zeros
Quantity_in_absolute_packaging has 1324 (1.5%) zeros Zeros
Quantity_in_Packaging-Absolute has 1324 (1.5%) zeros Zeros
Quantity_in_packing-relative has 64712 (71.3%) zeros Zeros
Safety_Stock has 9502 (10.5%) zeros Zeros
Send_code_to_Omri has constant value "1.0" Rejected
Serving_form has 7904 (8.7%) missing values Missing
Serving_form has a high cardinality: 53 distinct values Warning
skucode2 has a high cardinality: 3207 distinct values Warning
status has constant value "1.0" Rejected
Toxic_item has 7362 (8.1%) missing values Missing
type_of_packeging has 85285 (94.0%) missing values Missing
U#S#_Patent_Expires has 89166 (98.3%) missing values Missing
Validity_of_Ministry_of_Health_registration has 86155 (95.0%) missing values Missing
Validity_of_Ministry_of_Health_registration has a high cardinality: 83 distinct values Warning
VENDOR has a high cardinality: 155 distinct values Warning
consumption_previous_year is highly correlated with consumption and 2 other fieldsHigh Correlation
consumption is highly correlated with consumption_previous_year and 2 other fieldsHigh Correlation
Current_consumption is highly correlated with consumption and 2 other fieldsHigh Correlation
General_purpose_ is highly correlated with General_purposeHigh Correlation
General_purpose is highly correlated with General_purpose_High Correlation
Prediction is highly correlated with consumption and 2 other fieldsHigh Correlation
Quantity_in_Packaging-Absolute is highly correlated with Quantity_in_absolute_packagingHigh Correlation
Quantity_in_absolute_packaging is highly correlated with Quantity_in_Packaging-AbsoluteHigh Correlation
Safety_Stock is highly correlated with InventoryHigh Correlation
Inventory is highly correlated with Safety_StockHigh Correlation
General_purpose_ is highly correlated with General_purposeHigh Correlation
General_purpose is highly correlated with General_purpose_High Correlation
month_year_ is highly correlated with month_yearHigh Correlation
month_year is highly correlated with month_year_High Correlation

Variables

A_group_of_materials
Real number (ℝ≥0)

Distinct count38
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19380.69801
Minimum1000
Maximum20400
Zeros0
Zeros (%)0.0%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum1000
5-th percentile10170
Q120120
median20160
Q320190
95-th percentile20271
Maximum20400
Range19400
Interquartile range (IQR)70

Descriptive statistics

Standard deviation2749.296447
Coefficient of variation (CV)0.1418574525
Kurtosis9.513002339
Mean19380.69801
Median Absolute Deviation (MAD)1456.125489
Skewness-3.294453069
Sum1757771167
Variance7558630.951
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 1000. 1500. 5000. 9050. 10145. ... 20271.5 20273.5 20274.5 20338. 20400. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20190 14723 16.2%
 
20100 11235 12.4%
 
20120 9449 10.4%
 
20130 7744 8.5%
 
20210 5920 6.5%
 
20140 5192 5.7%
 
20270 4324 4.8%
 
20220 3800 4.2%
 
20180 3707 4.1%
 
20110 3031 3.3%
 
Other values (28) 21572 23.8%
 
ValueCountFrequency (%) 
1000 112 0.1%
 
2000 2 < 0.1%
 
8000 243 0.3%
 
10100 739 0.8%
 
10120 5 < 0.1%
 
ValueCountFrequency (%) 
20400 75 0.1%
 
20276 799 0.9%
 
20275 9 < 0.1%
 
20274 75 0.1%
 
20273 1389 1.5%
 

ABC
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
C
49485
B
30334
A
10878
ValueCountFrequency (%) 
C 49485 54.6%
 
B 30334 33.4%
 
A 10878 12.0%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter
Distinct count39
Unique (%)< 0.1%
Missing147
Missing (%)0.2%
Memory size708.6 KiB
20
14504
11
11556
13
9668
24
9137
14
8010
Other values (33)
37675
ValueCountFrequency (%) 
20 14504 16.0%
 
11 11556 12.7%
 
13 9668 10.7%
 
24 9137 10.1%
 
14 8010 8.8%
 
21 6052 6.7%
 
15 5254 5.8%
 
22 4072 4.5%
 
19 3955 4.4%
 
17 2889 3.2%
 
Other values (28) 15453 17.0%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length2.003241563
Min length2
Scatter

Alternative_purchasing_group
Categorical

MISSING
HIGH CARDINALITY
Distinct count561
Unique (%)0.6%
Missing3542
Missing (%)3.9%
Memory size708.6 KiB
1734
 
2963
1591
 
1889
1875
 
1705
1068
 
1515
1823
 
1352
Other values (555)
77731
ValueCountFrequency (%) 
1734 2963 3.3%
 
1591 1889 2.1%
 
1875 1705 1.9%
 
1068 1515 1.7%
 
1823 1352 1.5%
 
1317 1309 1.4%
 
1053 1217 1.3%
 
1642 1027 1.1%
 
1027 965 1.1%
 
1354 952 1.0%
 
Other values (550) 72261 79.7%
 
(Missing) 3542 3.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length4
Min length4
Scatter

ATC
Categorical

HIGH CARDINALITY
Distinct count1111
Unique (%)1.2%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
1/0/1900
 
6714
ATC:
 
1096
ATC:M01AE01
 
887
ATC:C10AA07
 
631
ATC:C10AA05
 
622
Other values (1106)
80747
ValueCountFrequency (%) 
1/0/1900 6714 7.4%
 
ATC: 1096 1.2%
 
ATC:M01AE01 887 1.0%
 
ATC:C10AA07 631 0.7%
 
ATC:C10AA05 622 0.7%
 
ATC:N03AX16 618 0.7%
 
ATC:A11AA06 611 0.7%
 
ATC:N06BA04 587 0.6%
 
ATC:A11CC05 545 0.6%
 
ATC:A07FA01 531 0.6%
 
Other values (1101) 77855 85.8%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length11
Mean length10.53277396
Min length4
Scatter
Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
EA
75965
PAC
 
14365
KIT
 
321
BLS
 
24
BT
 
22
ValueCountFrequency (%) 
EA 75965 83.8%
 
PAC 14365 15.8%
 
KIT 321 0.4%
 
BLS 24 < 0.1%
 
BT 22 < 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length3
Mean length2.162188386
Min length2
Scatter

Budget_Group
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
B
43242
E
23065
A
18708
D
 
5580
H
 
102
ValueCountFrequency (%) 
B 43242 47.7%
 
E 23065 25.4%
 
A 18708 20.6%
 
D 5580 6.2%
 
H 102 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

Chemical_/_biological
Categorical

MISSING
Distinct count3
Unique (%)< 0.1%
Missing7621
Missing (%)8.4%
Memory size708.6 KiB
1
80095
2
 
2981
ValueCountFrequency (%) 
1 80095 88.3%
 
2 2981 3.3%
 
(Missing) 7621 8.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

chronic
Boolean

Distinct count3
Unique (%)< 0.1%
Missing428
Missing (%)0.5%
Memory size708.6 KiB
0
51192
1
39077
(Missing)
 
428
ValueCountFrequency (%) 
0 51192 56.4%
 
1 39077 43.1%
 
(Missing) 428 0.5%
 
Distinct count3
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size708.6 KiB
ave
74914
pred
15782
ValueCountFrequency (%) 
ave 74914 82.6%
 
pred 15782 17.4%
 
(Missing) 1 < 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.174018986
Min length3
Scatter

consumption
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2224
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78024.95626
Minimum0
Maximum4392300
Zeros7514
Zeros (%)8.3%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1828
median4703
Q347350
95-th percentile376105
Maximum4392300
Range4392300
Interquartile range (IQR)46522

Descriptive statistics

Standard deviation245380.8128
Coefficient of variation (CV)3.144901638
Kurtosis107.022945
Mean78024.95626
Median Absolute Deviation (MAD)108086.0629
Skewness8.428340271
Sum7076629458
Variance6.021174328e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 4.0000000e+00 6.0000000e+00 1.3500000e+01 ... 1.1802800e+06 1.2108105e+06 1.3336665e+06 1.6117350e+06 4.3923000e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 7514 8.3%
 
1025 133 0.1%
 
1097 126 0.1%
 
384 121 0.1%
 
27290 120 0.1%
 
2427 118 0.1%
 
5688 117 0.1%
 
678 116 0.1%
 
136200 113 0.1%
 
1177 110 0.1%
 
Other values (2214) 82109 90.5%
 
ValueCountFrequency (%) 
0 7514 8.3%
 
1 25 < 0.1%
 
2 20 < 0.1%
 
3 16 < 0.1%
 
5 2 < 0.1%
 
ValueCountFrequency (%) 
4392300 60 0.1%
 
3318000 60 0.1%
 
3239610 60 0.1%
 
1633120 60 0.1%
 
1590350 60 0.1%
 

consumption_previous_year
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2755
Unique (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean887840.7473
Minimum0.0
Maximum46044870.0
Zeros3745
Zeros (%)4.1%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile735.4
Q111172
median62310
Q3598950
95-th percentile4533600
Maximum46044870
Range46044870
Interquartile range (IQR)587778

Descriptive statistics

Standard deviation2701115.98
Coefficient of variation (CV)3.042342885
Kurtosis98.28656155
Mean887840.7473
Median Absolute Deviation (MAD)1208727.411
Skewness8.102139676
Sum8.052449226e+10
Variance7.296027539e+12
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 2.0000000e+00 6.5000000e+00 1.1500000e+01 ... 1.5478775e+07 1.6305360e+07 1.8004425e+07 3.6691470e+07 4.6044870e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 3745 4.1%
 
12687 106 0.1%
 
9622 94 0.1%
 
11371 87 0.1%
 
5770 83 0.1%
 
17716 83 0.1%
 
4532 79 0.1%
 
454770 79 0.1%
 
3530 70 0.1%
 
8341 62 0.1%
 
Other values (2745) 86209 95.1%
 
ValueCountFrequency (%) 
0 3745 4.1%
 
1 59 0.1%
 
3 3 < 0.1%
 
10 50 0.1%
 
13 2 < 0.1%
 
ValueCountFrequency (%) 
46044870 60 0.1%
 
37611900 60 0.1%
 
35771040 60 0.1%
 
19037280 60 0.1%
 
16971570 60 0.1%
 

Current_consumption
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2212
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103993.5932
Minimum0
Maximum5666490
Zeros8480
Zeros (%)9.3%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1969
median5850
Q362790
95-th percentile520530
Maximum5666490
Range5666490
Interquartile range (IQR)61821

Descriptive statistics

Standard deviation322385.0218
Coefficient of variation (CV)3.100046955
Kurtosis101.3871974
Mean103993.5932
Median Absolute Deviation (MAD)144213.3013
Skewness8.168678076
Sum9431906920
Variance1.039321023e+11
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 5.000000e-01 1.500000e+00 2.500000e+00 5.500000e+00 ... 1.677765e+06 1.707915e+06 1.974510e+06 4.293390e+06 5.666490e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 8480 9.3%
 
3355 154 0.2%
 
120 151 0.2%
 
1023 147 0.2%
 
602 142 0.2%
 
180 137 0.2%
 
763 129 0.1%
 
722 119 0.1%
 
3814 118 0.1%
 
1401 117 0.1%
 
Other values (2202) 81003 89.3%
 
ValueCountFrequency (%) 
0 8480 9.3%
 
1 1 < 0.1%
 
2 77 0.1%
 
3 26 < 0.1%
 
4 10 < 0.1%
 
ValueCountFrequency (%) 
5666490 60 0.1%
 
4368000 60 0.1%
 
4218780 60 0.1%
 
2010420 60 0.1%
 
1938600 60 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing80310
Missing (%)88.5%
Memory size708.6 KiB
0
10058
1
 
329
ValueCountFrequency (%) 
0 10058 11.1%
 
1 329 0.4%
 
(Missing) 80310 88.5%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.656427445
Min length1
Scatter

dangerous_substance
Categorical

MISSING
Distinct count3
Unique (%)< 0.1%
Missing80326
Missing (%)88.6%
Memory size708.6 KiB
0
9727
1
 
644
ValueCountFrequency (%) 
0 9727 10.7%
 
1 644 0.7%
 
(Missing) 80326 88.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.65695668
Min length1
Scatter

Date_of_basket_entry
Categorical

MISSING
HIGH CARDINALITY
Distinct count54
Unique (%)0.1%
Missing41213
Missing (%)45.4%
Memory size708.6 KiB
01.01.1995
23781
01.01.2000
 
4298
01.03.2001
 
3333
01.03.1999
 
2760
01.03.2002
 
2118
Other values (48)
13194
ValueCountFrequency (%) 
01.01.1995 23781 26.2%
 
01.01.2000 4298 4.7%
 
01.03.2001 3333 3.7%
 
01.03.1999 2760 3.0%
 
01.03.2002 2118 2.3%
 
01.01.2009 1418 1.6%
 
01.05.2006 1402 1.5%
 
01.03.2008 1070 1.2%
 
15.01.2015 853 0.9%
 
01.01.2010 790 0.9%
 
Other values (43) 7661 8.4%
 
(Missing) 41213 45.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length7.273581265
Min length4
Scatter

Description_Alternative_purchasing_group
Categorical

MISSING
HIGH CARDINALITY
Distinct count561
Unique (%)0.6%
Missing3542
Missing (%)3.9%
Memory size708.6 KiB
Hypertension
 
2963
Bact.Inf., Sys.
 
1889
Statines
 
1705
EPILEPSY- Anticonvulsants
 
1515
Pain, Mod/Sev
 
1352
Other values (555)
77731
ValueCountFrequency (%) 
Hypertension 2963 3.3%
 
Bact.Inf., Sys. 1889 2.1%
 
Statines 1705 1.9%
 
EPILEPSY- Anticonvulsants 1515 1.7%
 
Pain, Mod/Sev 1352 1.5%
 
הורדת חום ושיכוך כאב 1309 1.4%
 
Depression-SSRI 1217 1.3%
 
Diabetes Mell. 1027 1.1%
 
BPH 965 1.1%
 
טיפול אנטי פטרייתי 952 1.0%
 
Other values (550) 72261 79.7%
 
(Missing) 3542 3.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length30
Mean length15.21452749
Min length2
Scatter

Description_outline
Categorical

MISSING
HIGH CARDINALITY
Distinct count306
Unique (%)0.3%
Missing6891
Missing (%)7.6%
Memory size708.6 KiB
Hypertension
 
3180
Diabetes Mell.
 
3005
food adittive
 
2471
Not Designated
 
2361
Epilepsy
 
2016
Other values (300)
70773
ValueCountFrequency (%) 
Hypertension 3180 3.5%
 
Diabetes Mell. 3005 3.3%
 
food adittive 2471 2.7%
 
Not Designated 2361 2.6%
 
Epilepsy 2016 2.2%
 
Bact.Inf., Sys. 1893 2.1%
 
cosmetics 1868 2.1%
 
Statines 1705 1.9%
 
Antidepressants 1683 1.9%
 
Pain, Mild/Mod. 1602 1.8%
 
Other values (295) 62022 68.4%
 
(Missing) 6891 7.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length30
Mean length12.00066154
Min length3
Scatter

df_index
Real number (ℝ≥0)

UNIQUE
Distinct count90697
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54602.10858
Minimum0
Maximum109199
Zeros1
Zeros (%)< 0.1%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile5442.8
Q127270
median54644
Q381884
95-th percentile103723.2
Maximum109199
Range109199
Interquartile range (IQR)54614

Descriptive statistics

Standard deviation31526.67024
Coefficient of variation (CV)0.5773892449
Kurtosis-1.200406177
Mean54602.10858
Median Absolute Deviation (MAD)27307.16505
Skewness-0.001034537182
Sum4952247442
Variance993930936.5
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 109199.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6141 1 < 0.1%
 
101028 1 < 0.1%
 
60104 1 < 0.1%
 
58057 1 < 0.1%
 
64202 1 < 0.1%
 
62155 1 < 0.1%
 
49869 1 < 0.1%
 
56014 1 < 0.1%
 
8913 1 < 0.1%
 
15058 1 < 0.1%
 
Other values (90687) 90687 > 99.9%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
109199 1 < 0.1%
 
109198 1 < 0.1%
 
109197 1 < 0.1%
 
109196 1 < 0.1%
 
109195 1 < 0.1%
 
Distinct count879
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
Minimum2006-03-06 00:00:00
Maximum2019-08-19 00:00:00
Mini histogram
Histogram
Histogram

European_patent_expires
Categorical

MISSING
Distinct count30
Unique (%)< 0.1%
Missing89409
Missing (%)98.6%
Memory size708.6 KiB
01.02.2015
168
01.06.2017
 
99
01.03.2016
 
98
01.10.2018
 
96
01.03.2017
 
81
Other values (24)
746
ValueCountFrequency (%) 
01.02.2015 168 0.2%
 
01.06.2017 99 0.1%
 
01.03.2016 98 0.1%
 
01.10.2018 96 0.1%
 
01.03.2017 81 0.1%
 
01.09.2019 65 0.1%
 
01.12.2022 61 0.1%
 
01.05.2020 60 0.1%
 
01.07.2018 59 0.1%
 
01.05.2019 53 0.1%
 
Other values (19) 448 0.5%
 
(Missing) 89409 98.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length4.085206787
Min length4
Scatter
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
0
81017
1
 
9680
ValueCountFrequency (%) 
0 81017 89.3%
 
1 9680 10.7%
 

For_adults
Categorical

MISSING
Distinct count2
Unique (%)< 0.1%
Missing86837
Missing (%)95.7%
Memory size708.6 KiB
1
3860
ValueCountFrequency (%) 
1 3860 4.3%
 
(Missing) 86837 95.7%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.872322128
Min length1
Scatter

Form_of_giving
Categorical

MISSING
Distinct count4
Unique (%)< 0.1%
Missing52945
Missing (%)58.4%
Memory size708.6 KiB
0
37724
17
 
22
15
 
6
ValueCountFrequency (%) 
0 37724 41.6%
 
17 22 < 0.1%
 
15 6 < 0.1%
 
(Missing) 52945 58.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length2.751579435
Min length1
Scatter

General_purpose
Categorical

HIGH CORRELATION
HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
0
89272
6
 
842
2
 
378
1
 
205
ValueCountFrequency (%) 
0 89272 98.4%
 
6 842 0.9%
 
2 378 0.4%
 
1 205 0.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

General_purpose_
Categorical

HIGH CORRELATION
HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
0
89272
6
 
842
2
 
378
1
 
205
ValueCountFrequency (%) 
0 89272 98.4%
 
6 842 0.9%
 
2 378 0.4%
 
1 205 0.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

GENERY_FATHER
Categorical

MISSING
HIGH CARDINALITY
Distinct count680
Unique (%)0.7%
Missing53724
Missing (%)59.2%
Memory size708.6 KiB
3999001541
 
186
3999001291
 
179
3999019172
 
178
3999001511
 
177
3999001491
 
176
Other values (674)
36077
ValueCountFrequency (%) 
3999001541 186 0.2%
 
3999001291 179 0.2%
 
3999019172 178 0.2%
 
3999001511 177 0.2%
 
3999001491 176 0.2%
 
3999001490 174 0.2%
 
3999001150 171 0.2%
 
3999001542 171 0.2%
 
3999003522 167 0.2%
 
3999001280 165 0.2%
 
Other values (669) 35229 38.8%
 
(Missing) 53724 59.2%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length10
Mean length6.445924341
Min length4
Scatter

Inventory
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2253
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104031.0092
Minimum0
Maximum6462780
Zeros10370
Zeros (%)11.4%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q11251
median5710
Q365750
95-th percentile488730
Maximum6462780
Range6462780
Interquartile range (IQR)64499

Descriptive statistics

Standard deviation362225.5124
Coefficient of variation (CV)3.481899438
Kurtosis134.4122878
Mean104031.0092
Median Absolute Deviation (MAD)144642.8975
Skewness9.625140566
Sum9435300440
Variance1.312073218e+11
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 1.000000e+00 3.500000e+00 1.150000e+01 2.900000e+01 ... 2.462100e+06 2.564640e+06 3.135090e+06 6.400395e+06 6.462780e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 10370 11.4%
 
875 130 0.1%
 
7306 120 0.1%
 
5258 120 0.1%
 
400 120 0.1%
 
6647 120 0.1%
 
2208 117 0.1%
 
41412 116 0.1%
 
74370 115 0.1%
 
2547 114 0.1%
 
Other values (2243) 79255 87.4%
 
ValueCountFrequency (%) 
0 10370 11.4%
 
2 8 < 0.1%
 
5 59 0.1%
 
6 32 < 0.1%
 
7 13 < 0.1%
 
ValueCountFrequency (%) 
6462780 60 0.1%
 
6338010 60 0.1%
 
3606000 60 0.1%
 
2664180 60 0.1%
 
2465100 60 0.1%
 

Inventory_of_consumption_months
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count32
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.324707543
Minimum0
Maximum786
Zeros32934
Zeros (%)36.3%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile3
Maximum786
Range786
Interquartile range (IQR)2

Descriptive statistics

Standard deviation7.409864836
Coefficient of variation (CV)5.593585449
Kurtosis8391.898955
Mean1.324707543
Median Absolute Deviation (MAD)1.183926836
Skewness82.270332
Sum120147
Variance54.90609688
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.00e+00 5.00e-01 1.50e+00 2.50e+00 3.50e+00 ... 2.55e+01 3.20e+01 1.27e+02 2.00e+02 7.86e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 32934 36.3%
 
1 30986 34.2%
 
2 16412 18.1%
 
3 6882 7.6%
 
4 1739 1.9%
 
5 626 0.7%
 
6 244 0.3%
 
7 157 0.2%
 
8 121 0.1%
 
10 104 0.1%
 
Other values (22) 492 0.5%
 
ValueCountFrequency (%) 
0 32934 36.3%
 
1 30986 34.2%
 
2 16412 18.1%
 
3 6882 7.6%
 
4 1739 1.9%
 
ValueCountFrequency (%) 
786 6 < 0.1%
 
255 2 < 0.1%
 
145 18 < 0.1%
 
109 15 < 0.1%
 
98 21 < 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing155
Missing (%)0.2%
Memory size708.6 KiB
1
56296
0
34246
(Missing)
 
155
ValueCountFrequency (%) 
1 56296 62.1%
 
0 34246 37.8%
 
(Missing) 155 0.2%
 

Item_Type
Categorical

Distinct count5
Unique (%)< 0.1%
Missing405
Missing (%)0.4%
Memory size708.6 KiB
0
89920
1
 
252
6
 
60
3
 
60
ValueCountFrequency (%) 
0 89920 99.1%
 
1 252 0.3%
 
6 60 0.1%
 
3 60 0.1%
 
(Missing) 405 0.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

Loading_group
Categorical

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
23
78836
11
 
3303
25
 
2814
13
 
1700
24
 
1186
Other values (7)
 
2858
ValueCountFrequency (%) 
23 78836 86.9%
 
11 3303 3.6%
 
25 2814 3.1%
 
13 1700 1.9%
 
24 1186 1.3%
 
12 839 0.9%
 
22 748 0.8%
 
15 595 0.7%
 
14 333 0.4%
 
19 206 0.2%
 
Other values (2) 137 0.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length2
Mean length1.998544605
Min length1
Scatter

Main_outline
Categorical

MISSING
HIGH CARDINALITY
Distinct count306
Unique (%)0.3%
Missing6891
Missing (%)7.6%
Memory size708.6 KiB
13
 
3180
12
 
3005
393
 
2471
0
 
2361
14
 
2016
Other values (300)
70773
ValueCountFrequency (%) 
13 3180 3.5%
 
12 3005 3.3%
 
393 2471 2.7%
 
0 2361 2.6%
 
14 2016 2.2%
 
60 1893 2.1%
 
392 1868 2.1%
 
432 1705 1.9%
 
420 1683 1.9%
 
54 1602 1.8%
 
Other values (295) 62022 68.4%
 
(Missing) 6891 7.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length2.558805694
Min length1
Scatter

Material_Type
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
ZHW1
83757
ZHW3
 
6697
ZUB1
 
243
ValueCountFrequency (%) 
ZHW1 83757 92.3%
 
ZHW3 6697 7.4%
 
ZUB1 243 0.3%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length4
Min length4
Scatter

month_year
Categorical

HIGH CORRELATION
Distinct count30
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
07/2018
 
3186
01/2019
 
3150
08/2018
 
3131
10/2018
 
3127
07/2019
 
3114
Other values (25)
74989
ValueCountFrequency (%) 
07/2018 3186 3.5%
 
01/2019 3150 3.5%
 
08/2018 3131 3.5%
 
10/2018 3127 3.4%
 
07/2019 3114 3.4%
 
01/2018 3107 3.4%
 
10/2017 3087 3.4%
 
05/2018 3086 3.4%
 
09/2019 3085 3.4%
 
03/2019 3082 3.4%
 
Other values (20) 59542 65.6%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length7
Mean length7
Min length7
Scatter

month_year_
Categorical

HIGH CORRELATION
Distinct count30
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
07/2018
 
3186
01/2019
 
3150
08/2018
 
3131
10/2018
 
3127
07/2019
 
3114
Other values (25)
74989
ValueCountFrequency (%) 
07/2018 3186 3.5%
 
01/2019 3150 3.5%
 
08/2018 3131 3.5%
 
10/2018 3127 3.4%
 
07/2019 3114 3.4%
 
01/2018 3107 3.4%
 
10/2017 3087 3.4%
 
05/2018 3086 3.4%
 
09/2019 3085 3.4%
 
03/2019 3082 3.4%
 
Other values (20) 59542 65.6%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length7
Mean length7
Min length7
Scatter
Distinct count3
Unique (%)< 0.1%
Missing56
Missing (%)0.1%
Memory size708.6 KiB
1
57411
0
33230
(Missing)
 
56
ValueCountFrequency (%) 
1 57411 63.3%
 
0 33230 36.6%
 
(Missing) 56 0.1%
 

Narcotic_/_psychotropic
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count6
Unique (%)< 0.1%
Missing41313
Missing (%)45.6%
Infinite0
Infinite (%)0.0%
Mean0.1018751012
Minimum0.0
Maximum5.0
Zeros46358
Zeros (%)51.1%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4602865691
Coefficient of variation (CV)4.518145881
Kurtosis39.7921035
Mean0.1018751012
Median Absolute Deviation (MAD)0.1912654278
Skewness5.756198531
Sum5031
Variance0.2118637257
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 46358 51.1%
 
1 1700 1.9%
 
2 839 0.9%
 
3 391 0.4%
 
5 96 0.1%
 
(Missing) 41313 45.6%
 
ValueCountFrequency (%) 
0 46358 51.1%
 
1 1700 1.9%
 
2 839 0.9%
 
3 391 0.4%
 
5 96 0.1%
 
ValueCountFrequency (%) 
5 96 0.1%
 
3 391 0.4%
 
2 839 0.9%
 
1 1700 1.9%
 
0 46358 51.1%
 

Plant
Categorical

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
7350
90697
ValueCountFrequency (%) 
7350 90697 100.0%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length6
Mean length6
Min length6
Scatter

Prediction
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2163
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75058.45476
Minimum0
Maximum4302782
Zeros8934
Zeros (%)9.9%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1744
median4390
Q346088
95-th percentile364336
Maximum4302782
Range4302782
Interquartile range (IQR)45344

Descriptive statistics

Standard deviation233217.7711
Coefficient of variation (CV)3.10714858
Kurtosis109.8697089
Mean75058.45476
Median Absolute Deviation (MAD)103912.464
Skewness8.449247232
Sum6807576671
Variance5.439052875e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 5.5000000e+00 1.6000000e+01 1.7500000e+01 ... 1.1548035e+06 1.2405160e+06 1.2994585e+06 1.5776160e+06 4.3027820e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 8934 9.9%
 
627 159 0.2%
 
465 141 0.2%
 
1466 139 0.2%
 
1294 137 0.2%
 
471 130 0.1%
 
598 125 0.1%
 
3785 120 0.1%
 
4855 120 0.1%
 
6903 120 0.1%
 
Other values (2153) 80572 88.8%
 
ValueCountFrequency (%) 
0 8934 9.9%
 
1 2 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
4302782 60 0.1%
 
3093749 60 0.1%
 
2913005 60 0.1%
 
1601720 60 0.1%
 
1553512 60 0.1%
 

PRICE
Real number (ℝ≥0)

SKEWED
Distinct count1453
Unique (%)1.6%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean40.75922585
Minimum0.01
Maximum13106.76
Zeros0
Zeros (%)0.0%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0.01
5-th percentile0.14
Q10.48
median4.8
Q316.42
95-th percentile78.85
Maximum13106.76
Range13106.75
Interquartile range (IQR)15.94

Descriptive statistics

Standard deviation330.9380054
Coefficient of variation (CV)8.119339818
Kurtosis784.5144498
Mean40.75922585
Median Absolute Deviation (MAD)59.68374267
Skewness24.55626018
Sum3696576.47
Variance109519.9634
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.14 1143 1.3%
 
0.16 893 1.0%
 
0.15 881 1.0%
 
0.33 873 1.0%
 
0.2 862 1.0%
 
0.29 854 0.9%
 
0.11 846 0.9%
 
0.51 762 0.8%
 
11.7 733 0.8%
 
0.1 715 0.8%
 
Other values (1442) 82131 90.6%
 
ValueCountFrequency (%) 
0.01 173 0.2%
 
0.03 195 0.2%
 
0.04 151 0.2%
 
0.05 210 0.2%
 
0.06 177 0.2%
 
ValueCountFrequency (%) 
13106.76 1 < 0.1%
 
12471.82 28 < 0.1%
 
10466.91 4 < 0.1%
 
9371.7 5 < 0.1%
 
8226.27 2 < 0.1%
 

Price_for_absolute_packaging
Real number (ℝ≥0)

ZEROS
Distinct count1879
Unique (%)2.1%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean308.4209645
Minimum0.0
Maximum49409.36
Zeros1324
Zeros (%)1.5%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile2.22
Q17.6
median16.38
Q371.37
95-th percentile1532.16
Maximum49409.36
Range49409.36
Interquartile range (IQR)63.77

Descriptive statistics

Standard deviation1365.125041
Coefficient of variation (CV)4.426174606
Kurtosis197.0158618
Mean308.4209645
Median Absolute Deviation (MAD)474.9928068
Skewness11.23716775
Sum27971622.53
Variance1863566.377
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 1324 1.5%
 
4.8 1074 1.2%
 
4.2 922 1.0%
 
15.3 635 0.7%
 
3.3 611 0.7%
 
8.7 599 0.7%
 
6 583 0.6%
 
11.7 530 0.6%
 
5.7 481 0.5%
 
3 480 0.5%
 
Other values (1868) 83454 92.0%
 
ValueCountFrequency (%) 
0 1324 1.5%
 
0.03 52 0.1%
 
0.07 60 0.1%
 
0.09 75 0.1%
 
0.1 179 0.2%
 
ValueCountFrequency (%) 
49409.36 4 < 0.1%
 
41212.53 1 < 0.1%
 
35099.96 5 < 0.1%
 
30433.62 1 < 0.1%
 
27037.8 7 < 0.1%
 

Pure_OTC
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
0
67205
1
23492
ValueCountFrequency (%) 
0 67205 74.1%
 
1 23492 25.9%
 

Quantity_in_absolute_packaging
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count54
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.13395151
Minimum0.0
Maximum448.0
Zeros1324
Zeros (%)1.5%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11
median20
Q330
95-th percentile90
Maximum448
Range448
Interquartile range (IQR)29

Descriptive statistics

Standard deviation28.59908774
Coefficient of variation (CV)1.236238769
Kurtosis18.16234971
Mean23.13395151
Median Absolute Deviation (MAD)20.06459564
Skewness2.834172256
Sum2098180
Variance817.9078198
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 207. 217. 335. 424. 448. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 32341 35.7%
 
30 20321 22.4%
 
28 6078 6.7%
 
60 4679 5.2%
 
20 4570 5.0%
 
100 3030 3.3%
 
10 2182 2.4%
 
50 2030 2.2%
 
5 1623 1.8%
 
56 1516 1.7%
 
Other values (44) 12327 13.6%
 
ValueCountFrequency (%) 
0 1324 1.5%
 
1 32341 35.7%
 
2 1026 1.1%
 
3 663 0.7%
 
4 1165 1.3%
 
ValueCountFrequency (%) 
448 1 < 0.1%
 
400 36 < 0.1%
 
270 3 < 0.1%
 
224 3 < 0.1%
 
210 30 < 0.1%
 

Quantity_in_Packaging-Absolute
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count54
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.13395151
Minimum0
Maximum448
Zeros1324
Zeros (%)1.5%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11
median20
Q330
95-th percentile90
Maximum448
Range448
Interquartile range (IQR)29

Descriptive statistics

Standard deviation28.59908774
Coefficient of variation (CV)1.236238769
Kurtosis18.16234971
Mean23.13395151
Median Absolute Deviation (MAD)20.06459564
Skewness2.834172256
Sum2098180
Variance817.9078198
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 207. 217. 335. 424. 448. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 32341 35.7%
 
30 20321 22.4%
 
28 6078 6.7%
 
60 4679 5.2%
 
20 4570 5.0%
 
100 3030 3.3%
 
10 2182 2.4%
 
50 2030 2.2%
 
5 1623 1.8%
 
56 1516 1.7%
 
Other values (44) 12327 13.6%
 
ValueCountFrequency (%) 
0 1324 1.5%
 
1 32341 35.7%
 
2 1026 1.1%
 
3 663 0.7%
 
4 1165 1.3%
 
ValueCountFrequency (%) 
448 1 < 0.1%
 
400 36 < 0.1%
 
270 3 < 0.1%
 
224 3 < 0.1%
 
210 30 < 0.1%
 

Quantity_in_packing-relative
Real number (ℝ≥0)

ZEROS
Distinct count65
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.70324267
Minimum0
Maximum1000
Zeros64712
Zeros (%)71.3%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q35
95-th percentile120
Maximum1000
Range1000
Interquartile range (IQR)5

Descriptive statistics

Standard deviation82.22988555
Coefficient of variation (CV)3.328708164
Kurtosis42.04184641
Mean24.70324267
Median Absolute Deviation (MAD)39.00106848
Skewness5.690316125
Sum2240510
Variance6761.754077
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.00e+00 5.00e-01 2.50e+00 3.50e+00 4.50e+00 ... 4.52e+02 4.77e+02 6.25e+02 8.75e+02 1.00e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 64712 71.3%
 
15 3022 3.3%
 
100 2390 2.6%
 
10 2085 2.3%
 
50 1960 2.2%
 
30 1905 2.1%
 
5 1510 1.7%
 
20 1174 1.3%
 
3 1057 1.2%
 
500 992 1.1%
 
Other values (55) 9890 10.9%
 
ValueCountFrequency (%) 
0 64712 71.3%
 
1 863 1.0%
 
2 803 0.9%
 
3 1057 1.2%
 
4 145 0.2%
 
ValueCountFrequency (%) 
1000 115 0.1%
 
750 58 0.1%
 
500 992 1.1%
 
454 51 0.1%
 
450 121 0.1%
 

Safety_Stock
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2132
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72403.71368
Minimum0
Maximum5242283
Zeros9502
Zeros (%)10.5%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1691
median3116
Q340883
95-th percentile295952
Maximum5242283
Range5242283
Interquartile range (IQR)40192

Descriptive statistics

Standard deviation288122.2307
Coefficient of variation (CV)3.979384704
Kurtosis149.4994849
Mean72403.71368
Median Absolute Deviation (MAD)103665.7207
Skewness10.41643199
Sum6566799620
Variance8.301441984e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 5.000000e-01 3.500000e+00 4.500000e+00 6.500000e+00 ... 1.718314e+06 1.769189e+06 2.945358e+06 5.168751e+06 5.242283e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 9502 10.5%
 
1235 136 0.1%
 
1392 130 0.1%
 
461 126 0.1%
 
736 122 0.1%
 
1796 121 0.1%
 
1635 120 0.1%
 
49325 120 0.1%
 
594 119 0.1%
 
82098 119 0.1%
 
Other values (2122) 80082 88.3%
 
ValueCountFrequency (%) 
0 9502 10.5%
 
1 19 < 0.1%
 
3 33 < 0.1%
 
4 1 < 0.1%
 
5 21 < 0.1%
 
ValueCountFrequency (%) 
5242283 60 0.1%
 
5095219 60 0.1%
 
3274632 60 0.1%
 
2616084 60 0.1%
 
2266471 60 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing11
Missing (%)< 0.1%
Memory size708.6 KiB
no
76674
yes
 
14012
(Missing)
 
11
ValueCountFrequency (%) 
no 76674 84.5%
 
yes 14012 15.4%
 
(Missing) 11 < 0.1%
 

Send_code_to_Omri
Boolean

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
1
90697
ValueCountFrequency (%) 
1 90697 100.0%
 

Serving_form
Categorical

MISSING
HIGH CARDINALITY
Distinct count53
Unique (%)0.1%
Missing7904
Missing (%)8.7%
Memory size708.6 KiB
TAB
37094
CAP
8375
CR
 
4923
COL
 
3135
CPL
 
2565
Other values (47)
26701
ValueCountFrequency (%) 
TAB 37094 40.9%
 
CAP 8375 9.2%
 
CR 4923 5.4%
 
COL 3135 3.5%
 
CPL 2565 2.8%
 
GEL 2051 2.3%
 
OIN 1977 2.2%
 
SOL 1789 2.0%
 
INJ 1639 1.8%
 
LIQ 1592 1.8%
 
Other values (42) 17653 19.5%
 
(Missing) 7904 8.7%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.03286768
Min length2
Scatter

skucode2
Categorical

HIGH CARDINALITY
Distinct count3207
Unique (%)3.5%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
10000005695
 
60
10000003201
 
60
10000005712
 
60
10000005869
 
60
10000006644
 
60
Other values (3202)
90397
ValueCountFrequency (%) 
10000005695 60 0.1%
 
10000003201 60 0.1%
 
10000005712 60 0.1%
 
10000005869 60 0.1%
 
10000006644 60 0.1%
 
10000005390 60 0.1%
 
10000006022 60 0.1%
 
10000005088 60 0.1%
 
10000005346 60 0.1%
 
10000005221 60 0.1%
 
Other values (3197) 90097 99.3%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length11
Mean length11
Min length11
Scatter

Sourcing_source
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
ארץ
90143
חו"ל
 
554
ValueCountFrequency (%) 
ארץ 90143 99.4%
 
חו"ל 554 0.6%
 

Composition

Contains charsFalse
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length4
Mean length3.006108251
Min length3
Scatter

status
Boolean

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
1
90697
ValueCountFrequency (%) 
1 90697 100.0%
 

storecode
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
3224
46054
3315
44643
ValueCountFrequency (%) 
3224 46054 50.8%
 
3315 44643 49.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length6
Mean length6
Min length6
Scatter

Toxic_item
Categorical

MISSING
Distinct count5
Unique (%)< 0.1%
Missing7362
Missing (%)8.1%
Memory size708.6 KiB
0
82932
1
 
333
3
 
55
2
 
15
ValueCountFrequency (%) 
0 82932 91.4%
 
1 333 0.4%
 
3 55 0.1%
 
2 15 < 0.1%
 
(Missing) 7362 8.1%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

TranQuantity
Real number (ℝ≥0)

Distinct count1943
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.9286195
Minimum0
Maximum12780
Zeros447
Zeros (%)0.5%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q13
median20
Q3129
95-th percentile955.2
Maximum12780
Range12780
Interquartile range (IQR)126

Descriptive statistics

Standard deviation591.2900672
Coefficient of variation (CV)3.002560363
Kurtosis91.31557023
Mean196.9286195
Median Absolute Deviation (MAD)262.7721293
Skewness7.8744681
Sum17860835
Variance349623.9436
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 1.5000e+00 2.5000e+00 3.5000e+00 ... 3.7325e+03 4.6850e+03 7.8750e+03 1.0565e+04 1.2780e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 10802 11.9%
 
2 7535 8.3%
 
3 5312 5.9%
 
4 4111 4.5%
 
60 3589 4.0%
 
30 3294 3.6%
 
5 2936 3.2%
 
6 2478 2.7%
 
120 2067 2.3%
 
90 1843 2.0%
 
Other values (1933) 46730 51.5%
 
ValueCountFrequency (%) 
0 447 0.5%
 
1 10802 11.9%
 
2 7535 8.3%
 
3 5312 5.9%
 
4 4111 4.5%
 
ValueCountFrequency (%) 
12780 2 < 0.1%
 
12420 1 < 0.1%
 
12240 1 < 0.1%
 
11460 2 < 0.1%
 
11264 1 < 0.1%
 

type_of_packeging
Categorical

MISSING
Distinct count7
Unique (%)< 0.1%
Missing85285
Missing (%)94.0%
Memory size708.6 KiB
BOX
3632
TUB
773
BOT
770
PKG
 
200
BAG
 
24
ValueCountFrequency (%) 
BOX 3632 4.0%
 
TUB 773 0.9%
 
BOT 770 0.8%
 
PKG 200 0.2%
 
BAG 24 < 0.1%
 
KIT 13 < 0.1%
 
(Missing) 85285 94.0%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.940328787
Min length3
Scatter

U#S#_Patent_Expires
Categorical

MISSING
Distinct count33
Unique (%)< 0.1%
Missing89166
Missing (%)98.3%
Memory size708.6 KiB
01.03.2016
 
132
01.02.2018
 
117
01.10.2020
 
113
01.07.2018
 
107
01.10.2018
 
96
Other values (27)
966
ValueCountFrequency (%) 
01.03.2016 132 0.1%
 
01.02.2018 117 0.1%
 
01.10.2020 113 0.1%
 
01.07.2018 107 0.1%
 
01.10.2018 96 0.1%
 
01.05.2020 86 0.1%
 
01.03.2017 81 0.1%
 
01.12.2019 71 0.1%
 
01.03.2020 66 0.1%
 
01.06.2018 65 0.1%
 
Other values (22) 597 0.7%
 
(Missing) 89166 98.3%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length4.101282292
Min length4
Scatter

Validity_of_Ministry_of_Health_registration
Categorical

MISSING
HIGH CARDINALITY
Distinct count83
Unique (%)0.1%
Missing86155
Missing (%)95.0%
Memory size708.6 KiB
28.02.2015
 
250
30.04.2015
 
247
30.11.2012
 
244
30.11.2013
 
181
31.01.2007
 
180
Other values (77)
3440
ValueCountFrequency (%) 
28.02.2015 250 0.3%
 
30.04.2015 247 0.3%
 
30.11.2012 244 0.3%
 
30.11.2013 181 0.2%
 
31.01.2007 180 0.2%
 
30.09.2012 142 0.2%
 
31.10.2013 120 0.1%
 
31.03.2015 119 0.1%
 
31.01.2008 113 0.1%
 
31.05.2014 113 0.1%
 
Other values (72) 2833 3.1%
 
(Missing) 86155 95.0%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length4.300473004
Min length4
Scatter

VENDOR
Categorical

HIGH CARDINALITY
Distinct count155
Unique (%)0.2%
Missing594
Missing (%)0.7%
Memory size708.6 KiB
400059
 
8701
400057
 
4983
408668
 
4824
400095
 
4199
400045
 
3597
Other values (149)
63799
ValueCountFrequency (%) 
400059 8701 9.6%
 
400057 4983 5.5%
 
408668 4824 5.3%
 
400095 4199 4.6%
 
400045 3597 4.0%
 
406885 3543 3.9%
 
407214 2828 3.1%
 
400026 2778 3.1%
 
410232 2705 3.0%
 
400032 2203 2.4%
 
Other values (144) 49742 54.8%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length6
Mean length5.986901441
Min length4
Scatter

volume
Real number (ℝ≥0)

Distinct count739
Unique (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean195.6533072
Minimum0
Maximum9973
Zeros127
Zeros (%)0.1%
Memory size708.6 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile3
Q16
median82
Q3244
95-th percentile741
Maximum9973
Range9973
Interquartile range (IQR)238

Descriptive statistics

Standard deviation368.7031077
Coefficient of variation (CV)1.884471635
Kurtosis73.50751411
Mean195.6533072
Median Absolute Deviation (MAD)205.7398271
Skewness6.197226803
Sum17745168
Variance135941.9817
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 1.5000e+00 2.5000e+00 3.5000e+00 ... 4.2755e+03 4.3515e+03 4.4655e+03 6.4060e+03 9.9730e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 7230 8.0%
 
4 5559 6.1%
 
5 4581 5.1%
 
6 4036 4.4%
 
7 3087 3.4%
 
2 3075 3.4%
 
9 2316 2.6%
 
8 2144 2.4%
 
10 1329 1.5%
 
11 1087 1.2%
 
Other values (729) 56253 62.0%
 
ValueCountFrequency (%) 
0 127 0.1%
 
1 947 1.0%
 
2 3075 3.4%
 
3 7230 8.0%
 
4 5559 6.1%
 
ValueCountFrequency (%) 
9973 2 < 0.1%
 
8258 11 < 0.1%
 
4554 60 0.1%
 
4377 17 < 0.1%
 
4326 54 0.1%
 

Volume_marker
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size708.6 KiB
LOW
57339
HIGH
33358
ValueCountFrequency (%) 
LOW 57339 63.2%
 
HIGH 33358 36.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.367796068
Min length3
Scatter

Correlations

Missing values

Sample

First rows

A_group_of_materialsABCAffiliation_GroupAlternative_purchasing_groupATCBasic_unit_of_measureBudget_GroupChemical_/_biologicalchronicConsumer_managementconsumptionconsumption_previous_yearCurrent_consumptionDangerous_material_is_charged_with_a_permitdangerous_substanceDate_of_basket_entryDescription_Alternative_purchasing_groupDescription_outlinedf_indexEstablishment_date_of_the_itemEuropean_patent_expiresExceptional_policyFor_adultsForm_of_givingGeneral_purposeGeneral_purpose_GENERY_FATHERInventoryInventory_of_consumption_monthsItem_in_health_basketItem_TypeLoading_groupMain_outlineMaterial_Typemonth_yearmonth_year_Must_prescribeNarcotic_/_psychotropicPlantPredictionPRICEPrice_for_absolute_packagingPure_OTCQuantity_in_absolute_packagingQuantity_in_Packaging-AbsoluteQuantity_in_packing-relativeSafety_StockSeasonalitySend_code_to_OmriServing_formskucode2Sourcing_sourcestatusstorecodeToxic_itemTranQuantitytype_of_packegingU#S#_Patent_ExpiresValidity_of_Ministry_of_Health_registrationVENDORvolumeVolume_marker
020140B151180ATC:G03AA09PACB1.00.0ave636983062.08491NoneNone01.04.2005Oral contraceptiveOral Contra.02006-03-06None0.0None00.00.0None881111.00.023171ZHW103/201703/20171.00.07350.073519.36196.560.021.02104671no1.0TAB10000005719ארץ1.03224.00.011BOXNoneNone408668101LOW
120190C201054ATC:N06AA04EAB1.01.0ave26450306420.034920NoneNone01.01.1995Depression-TricyclicAntidepressants12006-03-06None0.0None00.00.039990054226855021.00.023420ZHW102/201902/20191.00.07350.0253120.216.300.030.030031010no1.0TAB10000006427ארץ1.03224.00.0120NoneNoneNone4000953LOW
220270C241351ATC:D08AX08EAE1.00.0ave132113544.0222000Noneחיטוי ידייםcosmetics22016-05-18None0.0NoneNone0.00.0None87500.00.023392ZHW107/201707/20170.0NaN7350.0123114.0414.041.01.01500924no1.0GEL10000001880ארץ1.03315.00.02NoneNoneNone124207941HIGH
320180C191156ATC:M01AH01EAB1.00.0ave47073525410.059330NoneNoneNoneNSAIDS-COX 2 inhibtorsRheum. Arth.42006-03-06None0.0None00.00.039990106558739010.00.02365ZHW102/201702/20171.00.07350.0453730.565.600.010.010053858no1.0CAP10000006732ארץ1.03224.00.030NoneNoneNone40007710LOW
420120B131605ATC:C08CA01EAB1.01.0ave00.00NoneNone01.01.2000Calcium Chanell BlockersCalcium Chanell Blockers52006-09-26None0.0NoneNone0.00.03999019172001.00.023431ZHW104/201704/20171.00.07350.000.142.800.020.02000no1.0TAB10000004522ארץ1.03315.00.03220NoneNoneNone4000596HIGH
510150C72NoneZRP:102UB01EADNaN1.0ave00.00NoneNone01.03.2001NoneNone62009-06-03None0.0NoneNone0.00.0None001.00.023NoneZHW303/201903/20191.0NaN7350.0032.830.000.00.0000no1.0None10000003897ארץ1.03315.0NaN4NoneNoneNone400044548HIGH
620220C221946ATC:S01XA30PACE1.00.0ave142917751.01852NoneNoneNoneטיפות ליובש בעינייםArtificial Tears72010-12-27None0.0NoneNone0.00.0None295220.00.023128ZHW105/201805/20180.0NaN7350.0149025.74823.681.032.03201544no1.0COL10000003478ארץ1.03315.00.010NoneNoneNone400006505LOW
720100B111354ATC:A01AB09EAB1.00.0ave348039454.04049NoneNone01.01.1995טיפול אנטי פטרייתיFungal Inf, Oroph.82006-03-06None0.0None00.00.0None251001.00.02383ZHW101/201801/20180.00.07350.0325513.0613.060.01.01401740no1.0GEL10000006295ארץ1.03315.00.04NoneNone30.04.2012117469214LOW
810100C241429ZRP:A21ADEAENaN0.0ave5145821.0712NoneNoneNoneסד תמיכה לידNone92006-03-06None0.0NoneNone0.00.0None128220.00.023NoneZHW305/201805/20180.0NaN7350.048229.2529.251.01.010607no1.0None10000006992ארץ1.03224.0NaN2NoneNoneNone400123773LOW
920190C201017ATC:N07CA02EAB1.01.0ave1829422158850.0223275NoneNone01.01.1995Antivertigo preparationsMeniere's Dis.112006-03-06None0.0None00.00.0None27932511.00.023209ZHW106/201706/20171.00.07350.01774170.205.000.025.0250128059no1.0TAB10000006644ארץ1.03224.00.0575NoneNone30.09.20124019747HIGH

Last rows

A_group_of_materialsABCAffiliation_GroupAlternative_purchasing_groupATCBasic_unit_of_measureBudget_GroupChemical_/_biologicalchronicConsumer_managementconsumptionconsumption_previous_yearCurrent_consumptionDangerous_material_is_charged_with_a_permitdangerous_substanceDate_of_basket_entryDescription_Alternative_purchasing_groupDescription_outlinedf_indexEstablishment_date_of_the_itemEuropean_patent_expiresExceptional_policyFor_adultsForm_of_givingGeneral_purposeGeneral_purpose_GENERY_FATHERInventoryInventory_of_consumption_monthsItem_in_health_basketItem_TypeLoading_groupMain_outlineMaterial_Typemonth_yearmonth_year_Must_prescribeNarcotic_/_psychotropicPlantPredictionPRICEPrice_for_absolute_packagingPure_OTCQuantity_in_absolute_packagingQuantity_in_Packaging-AbsoluteQuantity_in_packing-relativeSafety_StockSeasonalitySend_code_to_OmriServing_formskucode2Sourcing_sourcestatusstorecodeToxic_itemTranQuantitytype_of_packegingU#S#_Patent_ExpiresValidity_of_Ministry_of_Health_registrationVENDORvolumeVolume_marker
9068720130C141517ATC:D08AX01EAB1.0NaNave251428150.03399NoneNone01.01.1995תכשיר לחיטויAntiseptic1091892014-08-24None0.0NoneNone0.00.0None340911.00.023135ZHW102/201702/2017NaNNaN7350.024143.283.280.01.011001760no1.0SOL10000002477ארץ1.03315.00.02NoneNoneNone400171362HIGH
9068820130C141519ATC:D02AC02EAE1.00.0pred9009426.01085NoneNoneNoneתכשיר רחצה טיפולי - תינוקותDry Skin1091902010-06-09None0.0NoneNone0.00.0None156310.00.025142ZHW106/201806/20180.0NaN7350.095130.1430.141.01.01500760yes1.0OIL10000003622ארץ1.03315.00.02NoneNoneNone400044985HIGH
9068920160A171651ATC:J07AJ52EAB2.00.0pred00.00NoneNoneNoneDPT Vac.DPT Vac.1091912011-03-22None1.0NoneNone0.00.03999002140001.00.01196ZHW101/201701/20171.0NaN7350.0063.1863.180.01.0100no1.0SRG10000003406ארץ1.03315.00.00NoneNoneNone400164159LOW
9069020240B241401ATC:V06DB15EAB1.00.0ave15000196110.019770NoneNone01.03.2008מזון ייעודיMetab. Disord.1091922014-11-04None0.0NoneNone0.00.0None1086001.00.025107ZHW104/201904/20190.0NaN7350.0163359.019.011.01.012208916no1.0LIQ10000002408ארץ1.03315.00.060NoneNoneNone401882465HIGH
9069120180B191076ATC:M04AC01EAB1.01.0ave4632605311080.0619170NoneNone01.01.1995FMFGout1091932006-03-06None0.0None00.00.0None33807001.00.02375ZHW101/201701/20171.05.07350.04510810.3510.500.030.0300231630no1.0TAB10000006474ארץ1.03224.00.01300NoneNoneNone4000452HIGH
9069220190C201823ATC:N02AA05EAB1.00.0ave6938445.0861NoneNoneNonePain, Mod/SevPain, Mod/Sev1091952013-10-14None0.0NoneNone0.00.0None81210.00.01359ZHW103/201803/20181.01.07350.073132.7632.760.01.0130778no1.0SYR10000002704ארץ1.03315.00.04NoneNoneNone400045168LOW
9069320130C141352ATC:D08AG02EAB1.00.0pred611274934.06830NoneNone01.01.1995חיטוי נגעים בעורAntiseptic1091962006-03-06None0.0None00.00.03999021150525801.00.023135ZHW103/201803/20180.00.07350.049243.513.510.01.01153447yes1.0OIN10000006784ארץ1.03315.00.019NoneNoneNone400057138HIGH
9069420100C111375ATC:A07BB01EAB1.00.0ave69973637500.0116040NoneNone01.01.1995טיפול בשלשול וכאב בטןDiarrhea1091972013-07-01None0.0NoneNone0.00.039990025409996011.00.02347ZHW106/201906/20190.0NaN7350.0534660.295.800.020.020081947no1.0TAB10000002793ארץ1.03315.00.0240NoneNoneNone4000266LOW
9069520150C161694ATC:H02AB07EAB1.01.0ave1694001913000.0221300NoneNone01.01.1995GlucocorticoidGlucocorticoid1091982006-03-06None0.0None00.00.0None20290011.00.023141ZHW109/201709/20171.00.07350.01620850.1515.000.0100.01000180690no1.0TAB10000006612ארץ1.03224.00.0300NoneNoneNone4000570LOW
9069620220B221082ATC:S01EA05EAA1.01.0ave9242108414.012071NoneNone01.10.2005Glaucoma alpha 2 agonistGlaucoma1091992006-03-06None0.0None00.00.03999001444635901.00.02315ZHW109/201909/20191.00.07350.0878316.3816.380.01.0154621no1.0COL10000005869ארץ1.03315.00.040NoneNoneNone40003682LOW